Parametrisation of the speech space using the self-organising neural network
نویسندگان
چکیده
Speech recognition is a diicult problem due to the inability of current systems to cope with connected speech. Neural networks are able to learn some aspects of this task. An unsupervised learning scheme like the self-organising map can be used to both classify and order the speech sounds and provide a front end to higher level processing. A map of phonemes (phonotopic map) is used to trace trajectories of sounds from utterances. The self-organising map provides a means of reducing the inherent dimensionality of the speech data. A crinkle factor which is used to determine how close the dimensionality of the map is to the dimensionality of the speech input shows that speech has an inherent dimensionality of at least three or four. A projection of the map and the speech data shows how the self-organising map ts the speech space.
منابع مشابه
Speech Processing Using Artificial Neural Networks
A three layer perceptron network is used to classify the /i/ sound using isolated words from diierent speakers. A classiication accuracy of 97% has been achieved. A map of phonemes is used to trace trajectories of utterances using the self-organising neural network. A crinkle factor is proposed which allows using the self-organising map to determine the inherent dimensionality of a set of point...
متن کاملGeneralisation and discrimination emerge from a self-organising componential network: a speech example
It is demonstrated that a componential code emerges when a self-organising neural network is exposed to continuous speech. The code’s components correspond to substructures that occur relatively independently of one another: words and phones. A capability for generalisation and discrimination develops without having been optimised explicitly. The componential structure is revealed by optimising...
متن کاملبهبود عملکرد سیستم بازشناسی گفتار پیوسته بوسیله ویژگیهای استخراج شده از مانیفولدهای گفتاری در فضای بازسازی شده فاز
The design for new feature extraction methods out of the speech signal and combination of their obtained information is one of the most effective approaches to improve the performance of automatic speech recognition (ASR) system. Recent researches have been shown that the speech signal contains nonlinear and chaotic properties, but the effects of these properties are not used in the continuous ...
متن کاملDeveloping A Fault Diagnosis Approach Based On Artificial Neural Network And Self Organization Map For Occurred ADSL Faults
Telecommunication companies have received a great deal of research attention, which have many advantages such as low cost, higher qualification, simple installation and maintenance, and high reliability. However, the using of technical maintenance approaches in Telecommunication companies could improve system reliability and users' satisfaction from Asymmetric digital subscriber line (ADSL) ser...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کامل